Person, Organization, or Personage: Towards User Account Type Prediction in Microblogs
نویسندگان
چکیده
During the past decade, microblog services have been extensively utilized by millions of business and private users as one of the most powerful information broadcasting tools. For example, Twitter attracted many social science researchers due to its high popularity, constrained format of thought expression, and the ability to reflect actual trends. However, unstructured data from microblogs often suffer from the lack of representativeness due to the tremendous amount of noise. Such noise is often introduced by the activity of organizational and fake user accounts that may not be useful in many application domains. Aiming to tackle the information filtering problem, in this paper, we classify Twitter accounts into three categories: “Personal”, “Organization” and “Personage”. Specifically, we utilize various text-based data representation approaches to extract features for our proposed microblog account type prediction framework “POP-MAP”. To study the problem at a crosslanguage level, we harvested and learned from a multi-lingual Twitter dataset, which allows us to achieve better classification performance, as compared to various state-of-the-art baselines.
منابع مشابه
News Feature Extraction for Events on Social Network Platforms
Microblog-based social network platforms like Twitter and Sina Weibo have been important sources for news event extraction. However, existing works on microblog event extraction, which usually use keywords, entities, or selected microblogs to represent events, are not able to extract details of an event. Based on the view of news report, an event should present detailed news features, i.e., whe...
متن کاملPrediction in Social Media for Monitoring and Recommendation
Title of dissertation: PREDICTION IN SOCIAL MEDIA FOR MONITORING AND RECOMMENDATION Shanchan Wu, Doctor of Philosophy, 2012 Dissertation directed by: Professor Louiqa Raschid Department of Computer Science Social media including blogs and microblogs provide a rich window into user online activity. Monitoring social media datasets can be expensive due to the scale and inherent noise in such data...
متن کاملTowards a Microblogs Data Management System (Invited Industrial Paper)
This paper advocates for the need to build a Microblogs Data Management System (MDMS) as an end-toend data management system to support indexing, querying, and analyzing microblogs, e.g., tweets, comments, or check-in’s. We identify a set of characteristics for microblogging environments that are distinguishing from any other data management environment. Then, we propose a system architecture f...
متن کاملNATIONAL UNIVERSITY OF SINGAPORE School of Computing PH.D DEFENCE - PUBLIC SEMINAR
Microblogging services have revolutionized the way people exchange information, and have emerged as an essential forum for people to air their views on topics of common interests. Therefore, monitoring and analyzing the rich and continuous flow of user-generated contents in microblog networks can yield unprecedentedly valuable information, which would not have been available from traditional me...
متن کاملPredicting Popularity of Microblogs in Emerging Disease Event
During emerging disease outbreaks, massive information are disseminated through social network. In China, Sina microblog system as the biggest social network provide a novel way to monitoring the development of emerging disease and public awareness. However, only a small percentage of microblogs could wide spread. Therefore, predict popularity of microblogs timely are meaningful for emergency m...
متن کامل